The CAI Analyser Package: Inferring Gene Expressivity from Raw Genomic Data

نویسندگان

  • Matteo Ramazzotti
  • Matteo Brilli
  • Renato Fani
  • Giampaolo Manao
  • Donatella Degl'Innocenti
چکیده

UNLABELLED The Codon Adaptation Index (CAI) was introduced by Sharp and Li in 1987 to quantify codon usage similarities between a coding sequence and a set of reference sequences. When synonymous codons for a given amino acid exist, highly expressed genes seem to prefer some of them, according to tRNA abundance and thermodynamic issues. Some authors have described CAI-based methods to derive expressivity measures for all genes in a genome, in a computational framework. Here we present the CAIAP (CAI Analyser Package), a platform independent package of computer programs allowing the calculation of the CAI and a deep study of gene expressivity from raw gene sequences. Our approach implements and optimizes a procedure to derive the reference sequences from whole genomes and use their codon usage for CAI estimation. Moreover, a set of analysis tools are provided to perform statistical analyses and therefore to give robustness to results. OBJECTIVE Our efforts were aimed to produce an easy-to-use and fully automatic set of programs specifically designed for the analysis of gene expressivity and inter-species comparisons on a great number of genomes. Moreover, the output integrates information coming from functional annotations of genes. We are maintaining a web server storing our analyses for hundreds of genomes, allowing intergenomic comparison of data thanks to dedicated search engines. The CAIAP server is hosted at www4.unifi.it/scibio/bioinfo/caiap/html. The programs (maintained as Perl scripts) are also available for download at the same location.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

predictionet : a package for inferring predictive networks from high-dimensional genomic data

2 Biology and Data 3 2.1 RAS signaling pathway . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2.2 Colon cancer gene expression data . . . . . . . . . . . . . . . . . . . . . . . . . 3 2.3 Known gene interactions extracted from the biomedical literature and public structured databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2.4 Predictionet package . ...

متن کامل

Automated Scanning Electron Microscope Based Mineral Liberation Analysis An Introduction to JKMRC/FEI Mineral Liberation Analyser

This paper presents the methods and techniques used in the recently developed JKMRC/FEI Mineral Liberation Analyser (MLA). The MLA system consists of a specially developed software package and a standard modern SEM fitted with an energy dispersive spectrum (EDS) analyser. The on-line program of the MLA software package controls the SEM, captures sample images, performs necessary image analysis ...

متن کامل

pyGeno: A Python package for precision medicine and proteogenomics [version 1; referees: awaiting peer review]

pyGeno is a python package mainly intended for precision medicine applications that revolve around genomics and proteomics. It integrates reference sequences and annotations from Ensembl, genomic polymorphisms from the dbSNP database and data from next-gen sequencing into an easy to use, memory-efficient and fast framework, therefore allowing the user to easily explore subject-specific genomes ...

متن کامل

pyGeno: A Python package for precision medicine and proteogenomics

pyGeno is a Python package mainly intended for precision medicine applications that revolve around genomics and proteomics. It integrates reference sequences and annotations from Ensembl, genomic polymorphisms from the dbSNP database and data from next-gen sequencing into an easy to use, memory-efficient and fast framework, therefore allowing the user to easily explore subject-specific genomes ...

متن کامل

GENOTRACE: cDNA-based local GENOme assembly from TRACE archives

UNLABELLED GENOTRACE identifies the genomic organization for a cDNA using raw data from genome sequencing projects in progress (trace archives). Local genomic contigs are generated, allowing for example the design of PCR primers in intronic sequences to amplify coding regions of a gene, needed for example for mutation or SNP detection. AVAILABILITY The package and examples of output files can...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • In silico biology

دوره 7 4-5  شماره 

صفحات  -

تاریخ انتشار 2007